
SUBSTITUTED NUCLEIC AC ID MIMICS 

^ FIELD OF THE INVENTION 

This invention is directed to the synthesis and use 
of nucleic acid mimics containing one or more heterocyclic base 
5 moieties substituted by chemical groups in order to diminish 
or prevent the formation of triplexes. This effect can be used 
to design antisense or probe reagents that avoid forming 
triplexes • 



BACKGROUND OF THE INVENTION 

10 In the art, there are known several nucleic acid 

mimics having nucleobases bound to backbones other than the 
naturally occurring ribonucleic acid or deoxyribonucleic acid 
backbones having the ability to bind to nucleic acids having 
a nucleobase secjuence complementary to the base sequence of the 

15 nucleic acid mimic. Among these, only the peptide nucleic 
acids (PNA's) as described, for example, in WO 92/20702 have 
demonstrated a likelihood for potential use as therapeutic and 
diagnostic reagents. This may be due to their ability to bind 
nucleic acids (NAs) of complementary nucleobase sequence with 

20 a higher affinity than shown by the corresponding wild-type 
nucleic acid. 

One of the unique properties of PNAs is their ability 
to form PNA2-NA triplexes that are more stable than the 
corresponding PNA-NA duplexes. This ability can be used 
25 advantageously for various purposes including PGR clamping 
(WO 93/25706) • However, there are some drawbacks for 
applications that require sequence selection, because such 
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selection would be biased for triplex forming sequences. 

Therefore, there is a need for PNAs that do not form such 
triplexes. 



OBJECTS OF THE INVENTION 

5 It is an object of this invention to provide 

substituted nucleic acid mimics that do not preferentially form 
triplexes with nucleic acids. 

It is a further object of this invention to provide 
methods for sequence selective determination of nucleic acids. 
10 It is yet a further object of this invention to 

provide therapeutic, diagnostic and research reagents that can 
modulate the expression of nucleic acids which encode proteins 
suspected of causing or indicating the existence of a disease 
state . 



15 BRIEF DESCRIPTION OF THE INVENTION 

In accordance with this invention there are provided 
nucleic acid mimics containing one or more heterocyclic bases 
substituted by a sterically bulky substituent at a position 
which is 1, 2 or 3 atoms removed from the atom of the base 

20 which is attached to the backbone. 

Further there are provided methods for disfavouring 
the formation of triplex structures comprising a nucleic acid 
strand and two strands of a nucleic acid mimic, having a base 
sequence complementary to the nucleic acid strand. Such 

25 methods include incubating a mixture of the nucleic acid and 
the nucleic acid mimic under conditions suitable for forming 
a nucleic acid/nucleic acid mimic duplex. The formation of 
triplexes is avoided by providing sterically bulky substituents 
on the nucleic acid mimic which are located at positions that 

30 would be in close proximity to each other if bound to nucleic 
acid in a triplex. 

In accordance with this invention there are provided 
methods for the determination of a nucleic acid by providing 
a nucleic acid mimic substituted at positions which are 1, 2 

35 or 3 atoms removed from the atom of the base which is attached 
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to the backbone. Said nucleic acid mimic is incubated with the 
nucleic acid under conditions suitable for the formation of a 
duplex between the nucleic acid mimic and the nucleic acid. 
The occurrence of the duplex is related to the identity or 
existence of the nucleic acid. 

The present invention provides nucleic acid mimics 
for modulating the expression of nucleic acids that encode 
proteins which are suspected of producing a disease state in 
mammals. The nucleic acid mimics of this invention can be used 
in therapeutics, diagnostics and as research reagents. 

One favourable aspect of this invention is that 
nucleic acid mimics substituted as described herein 
substantially retain the ability to form duplexes with good 
efficiency and discrimination comparable to the corresponding 
unsubstituted nucleic acid mimic. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1 is a schematic illustrating an exemplary 
synthesis of a PNA monomer containing cytosine substituted at 
the position. 

20 Figure 2 is a schematic illustrating the Watson-Crick 

base pairing between substituted cytosine of a PNA and 
guanos ine of a DNA. 

DETAILED DESCRIPTION OF THE INVENTION 

In accordance with this invention, novel compounds 
25 are provided that are useful for disfavouring the formation of 
triplexes with nucleic acids. A nucleic acid mimic in 
accordance with the invention is a molecule having a sequence 
of modified heterocyclic bases, preferably naturally occurring 
bases, e.g. those which occur in "wild-type" nucleic acids, 
30 bound to a non-naturally occurring backbone. The nucleic acid 
mimics bind to a nucleic acid having a complementary base 
sequence through base pairing. 

Preferred nucleic acid mimics are molecules wherein 
the base moieties are bound to the backbone via an amine 
35 nitrogen atom of the backbone. Preferred backbone structures 
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for the mimics are described in WO 92/20702, United States 
Patent application Serial No. 08/054,363, filed April 26, 1993, 
United States Patent application Serial No. 08/319,411, filed 
October 6, 1994 and United States Patent application Serial No. 
5 08/366,231, filed December 28, 1994. The above-referenced 
disclosures are herein incorporated by reference. 

Heterocyclic bases of the nucleic acid mimics of the 
present invention are heterocyclic moieties that are able to 
base pair with nucleobases of a nucleic acid by hydrogen 

10 bonding. In the case of triplex formation, two kinds of 
interactions are involved: Watson-Crick binding and Hoogsteen 
binding. The formation of triplexes between PNA and NA is 
described in WO 95/01370. 

The term "heterocyclic moiety" or "heterocyclic base" 

15 includes the naturally occurring purine and pyrimidine 
nucleobases. For the purpose of this invention, the term 
"pyrimidine" refers to any 1,3-diazine irrespective of its 
substituents. The naturally occurring pyrimidine nucleobases 
are cytosine, thymine and uracil. Naturally occurring purine 

20 nucleobases include adenine and guanine. The term 

"heterocyclic moiety" or "heterocyclic base" also includes non- 
naturally occurring nucleobases. An example of a non-naturally 
occurring base is a base in which any of the ring atoms of the 
nucleobases is replaced by another atom. For example, CH may 

25 be replaced by N and vice versa. Such modifications can occur 
at more than one position. Another example of a non-naturally 
occurring base is a base in which the 2- and 4 -substituents of 
a naturally occurring base are reversed. Structures of 
naturally and non-naturally occurring pyrimidine bases are 

3 0 shown below (the third structure from the left is that of a 
non-naturally occurring pyrimidine base known as pseudo- 
isocytosine) : 
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In the invention^ the heterocyclic moiety is attached 
to the backbone at a specific ring position of the heterocycle. 
In the case of substituted naturally occurring nucleobases, 
this position is preferably occupied by a nitrogen atom. 
5 According to this invention, the sterically bulky substituent 
can be attached to the heterocyclic moiety at a position which 
is 1, 2 or 3 atoms removed from the position of attachment of 
the heterocyclic moiety to the backbone. In case of the 
pyrimidine bases, positions conventionally numbered as ring 

10 position 4, 5 and 6 are preferred. The 4-position is most 
preferred for attaching a bulky substituent. Some effect on 
triplex formation may also occur when the substituent is 
attached to the 5- and 6-positions, but in this case, the 
substituents should be sterically bulkier than substituents 

15 located at position 4. In the case of non-naturally occurring 
bases, positions corresponding to pyrimidine positions 4, 5 and 
6 in their spatial orientation are also preferred. In case of 
substitution on the 5-position of a non-naturally occurring 
base, the triplex formation is pH dependent as it is for a 

20 naturally occurring base such as cytosine. Duplex formation 
is likely not effected by pH in any case. 
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Shown above are formulae of heterocyclic bases having 
substituents designated R. Each R can independently be 
-N0,-N02, -SO3, -CN, -OH, -SH, -POj^', -COOH, -R' , -F, -CI, -Br, 
-I, -O-R', -S-R', -N(R')2, -C(R')^ -C(=X)(R^), C (=X) (-Y-R' ) , 
5 S(=Z)i.2(-Y-R') , in which Z is O, X is O, S or NH, and Y is O, 
S or NH, wherein at least one R is a sterically bulky group. 
Preferred bulky groups contain 3 non-hydrogen atoms or more, 
most preferred bulky groups contain 6 non-hydrogen atoms or 
more and are preferably cyclic and/or aromatic. It will be 

10 apparent from the description of this invention that these 
preferred definitions apply to the case wherein at least one 
R substituent is different from hydrogen. In case 2 or more 
R groups are bulky, the spatial requirements for achieving 
inhibition may be reduced, for example, from 6 atoms to 3 

15 atoms. 



n 
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It is preferred that R groups are acyl groups, 
especially aromatic acyl groups. It is especially preferred 
that the acyl groups be bound to a nitrogen atom at position 
4 of a pyrimidine base. An especially preferred acyl group is 
5 the benzoyl group. 

is preferably selected from H; alkyl, alkenyl or 
alkynyl (each having from 1-50 C atoms) ; aryl, naphthyl, 
biphenyl or tolyl (each having from 6-50 C atoms) . These 
groups may be straight or branched chain, syicoaetric or 

10 asymmetric, chiral or achiral, and may contain one or more 
heteroatoms selected from N, NH, S and O, and may also comprise 
fused aromatic systems. R' may be heterocyclic, including 
pyridyl, imidazolyl, pyrimidinyl, pyridazinyl, quinolyl, 
acridinyl, imidazolyl, pyrrolyl, furanyl, thienyl, isoxazolyl, 

15 oxazolyl, thiazolyl or biotinyl and may be bound or fused to 
any available position. 

R' may be substituted, preferentially with one or 
more lower organic groups (up to 10 carbon-atoms) or 
derivatives thereof which enhance the triplex inhibiting effect 

20 or are otherwise useful herein. These may be groups such as 
alkyl, alkenyl, alkynyl, aryl, naphthyl, biphenyl, tolyl, 
benzyl, and groups such as -N0,-N02, -SO3, -CN, -OH, -SH, -POg^', 
-COOH, -F, -CI, -Br, and -I. 

Compounds of the present invention can be 

25 conveniently prepared according to the methods described in 
WO 92/20702. An especially preferred method of synthesis uses, 
in a first step, the synthesis of the base substituted by the 
sterically bulky substituent, preferably having also attached 
a reactive group and/ or a linker moiety for attachment of the 

3 0 modified base to a monomer ic backbone unit, for example, 
protected N-aminoethylglycine. In a second step, bases are 
attached via the linker moiety to a nitrogen atom at the 
preformed and protected monomer ic backbone unit. In a third 
step, the base-containing monomer is prepared for 

35 oligomer ization with other bases containing monomer ic backbone 
units or an already formed oligomer, e.g. cleaving of 
protecting groups at one end of the backbone unit and/ or 
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activating this end for oligomer ization. In a fourth step, the 
base-containing monomers are oligomerized depending upon the 
sequence requirements for complementarity for duplex formation 
with a complementary nucleic acid. 
5 Preferred monomer ic backbone units that may be 

protected with a protecting group appropriate for the active 
groups during synthesis of the monomeric backbone unit are 
compounds of the general formula: 



r3 




wherein: 

10 is C1-C4 alkyl substituted by -COOP^ -NHP^ -OP* or 

SP*, wherein P* is hydrogen or a protecting group; 

r2 is C1-C4 alkyl substituted by -COOP^, -NHP^ -OP^ or 
SP^, wherein P^ is hydrogen or a protecting group; 

M is a naturally or non-naturally occurring 
15 heterocyclic moiety bound by a linker to nitrogen, said linker 
being 1-3 atoms in length; and 

is a sterically bulky substituent containing at 
least 3 or more non-hydrogen atoms. 

Monomers which are not substituted by R^ are 
20 disclosed in WO 92/20702. In a preferred case, R* contains the 
group -COOP* and R^ contains the group -NHP^, wherein the 
protecting groups (P* and P^) are cleavable under different 
reaction conditions from each other. 

For example, in certain preferred embodiments, 
25 peptide nucleic acid backbones may be employed. Such backbones 
have the general formula (I) : 



Q 




wherein: 



n is at least 2, 

each of L^-L" is independently selected from the group 
5 consisting of hydrogen, hydroxy, (C1-C4) alkanoyl, naturally 
occurring nucleobases, non-naturally occurring nucleobases, 
aromatic moieties, DNA intercalators, nucleobase-binding 
groups, heterocyclic moieties, and reporter ligands, at least 
one of L^-L** being a naturally- or non-naturally-occurring 
10 nucleobase substituted with a sterically bulky group as 
described herein; 

each of C^-C° is (CR*^R^)y where R*^ is hydrogen and R^ is 
selected from the group consisting of the side chains of 
naturally occurring alpha amino acids, or R*^ and R^ are 
15 independently selected from the group consisting of hydrogen, 
(Cj-C^) alkyl, aryl, aralkyl, heteroaryl, hydroxy, (Ci-C^) alkoxy , 
(Ci-Cg) alkylthio, NR^R"* and SR*, where R^ and R"* are as defined 
above, and R^ is hydrogen, (Ci-C^) alkyl, hydroxy-, alkoxy-, or 
alkylthio- substituted (Cj-C^) alkyl, or R*^ and R^ taken together 
2 0 complete an alicyclic or heterocyclic system; 

each of D^-D° is (CR^R^)^ where R^ and R^ are as defined 

above ; 

each of y and z is zero or an integer from 1 to 10, 
the sum y + z being greater than 2 but not more than 10; 
25 each of G^-ff*"^ is -NR^CO-, -NR^CS-, -NR^SO- or -NR^SOj-, 

in either orientation, where R^ is as defined above; 

each pair of A^-A° and B^-B'' are selected such that: 
(a) A is a group of formula (Ila) , (lib) or (lie) 
and B is N or R^N"*"; or 
30 (b) A is a group of formula (lid) and B is CH; 
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(Ila) 



0 

II 

H C- 



(Ilb) 



1^1 I 

-C N- 



(IIC) 



(Ild) 



where : 

5 X is O, S, Se, NR^, CHj or 0(^3)2; 

Y Is a single bond, O, S or NR"*; 

each of p and q Is zero or an Integer from 1 to 5, 
the sum p+q being not more than 10; 

each of r and s Is zero or an Integer from 1 to 5, 
10 the sum r+s being not more than 10; 

each R^ and R^ Is Independently selected from the 
group consisting of hydrogen, (Ci-C4)alkyl which may be hydroxy- 
or alkoxy- or alkylthio-substituted, hydroxy, alkoxy, 
alkylthio, amino and halogen; 
15 each ofAG^-ff*' is -NR^CO-, -NR^CS-, -NR^SO- or -NR^S02-, 

in either orientatojSn^ where R^ is as defined above; 

Q is -CO2H, -CONR'R", -SO3H or -S02NR'R" or an 
activated derivative of -COjH or -SO3H; and 

I is -NHR"'R'''' or -NR' ' 'C (O) R' " S where R' , R" , 
20 R' ' ' and R' ' ' ' are independently selected from the group 
consisting of hydrogen, alkyl, amino protecting groups, 
reporter ligands, intercalators, chelators, peptides, proteins, 
carbohydrates, lipids, steroids, oligonucleotides and soluble 
and non-soluble polymers. 
25 In certain embodiments, at least one A is a group of 
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formula (lie) and B is N or R^N^. In other embodiiaent^s , A is 
a group of formula (Ila) or (lib), B is N or R^N^, and at least 
one of y or 2 is not 1 or 2 . 

Some preferred peptide nucleic acids have general 



5 formula (Ilia) or (lllb) : 




(Ilia) 




(Illb) 

wherein: 

each L is independently selected from the group 
10 consisting of hydrogen, phenyl, heterocyclic base moieties, 
including those substituted with a sterically bulky group or 
groups, naturally occurring nucleobases, and non-naturally 
occurring nucleobases; 

each R^* is independently selected from the group 
15 consisting of hydrogen and the side chains of naturally 
occurring alpha amino acids; 
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n is an integer from 1 to 60; 

each of k, 1, and m is independently zero or an 
integer from 1 to 5; 

p is zero or 1; 
5 is OH, NH2 or -NHLysNHj; and 

R' is H or COCH3. 

Particularly preferred are compounds having formula 
(Ilia) or (Illb) wherein each L is independently selected from 
the group consisting of the nucleobases thymine (T) , adenine 

10 (A) , cytosine (C) , guanine (G) and uracil (U) , especially where 
one or more are modified with a sterically bulky substituent 
in accordance with this invention, k and m are zero or 1, and 
n is an integer from 1 to 30, in particular from 4 to 20. 

The peptide nucleic acids of the invention can be 

15 synthesized by adaptation of standard peptide synthesis 
procedures, either in solution or on a solid phase. The 
synthons used are specially monomer amino acids or their 
activated derivatives, protected by standard protecting groups. 
The oligonucleotide analogs also can be synthesized by using 

20 the corresponding diacids and diamines. 

Thus, monomer synthons useful for incorporation into 
PNA of the preceding formulae include those selected from the 
group consisting of amino acids, diacids and diamines, having 
general formulae: 



f t ! 



0 r 



or '^^^^^ 



25 (IV) (V) (VI) 



wherein L, A, B, C and D are as defined above, except that any 
amino groups therein may be protected by amino protecting 
groups; E is COOH, CSOH, SOOH, SOjOH or an activated derivative 
thereof; and F is NHR^ or NPgR^, where R^ is as defined above 
3 0 and Pg is an amino protecting group. 

Preferred monomer synthons according to the invention 
include those having formula (Villa) - (VIIIc) : 



)3 
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or aittino-protected and/ or acid terminal activated derivatives 
thereof, wherein L is selected from the group consisting of 
hydrogen, phenyl, heterocyclic moieties, naturally occurring 
nucleobases, and non-naturally occurring nucleobases; and R^' 
is selected from the group consisting of hydrogen and the side 
chains of naturally occurring alpha amino acids. 

Also useful in the present invention are chiral PNA 
backbones. Such backbones are preferably derived from two or 
more monomers, at least one of which contain a aliphatic cyclic 
structure. Representitive of such monomers are, fcte&aof 
formula: 



HzN 




OH 



wherein: 

B is a naturally or non-naturally occurring 
nucleobase which may be substituted with a sterically bulky 
15 group in accordance with this invention; 

at least one of Ca or CjS is in the S configuration; 

and 

nisO, 1, 2, or 3. 

In preferred embodiments Ccr and CjS are in the S 
20 configuration. In further preferred embodiments of the 
invention B is adenine, cytosine, guanine, thymine, or uracil. 
In more preferred embodiments n is 2 . 

In further preferred embodiments the peptide nucleic 
acid oligomers contain at least one peptide nucleic acid 
25 monomer having a ( 2 -aminoethyl) glycine backbone with a chiral 



is 
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center in the ethyl portion of the backbone. The monomer is 
incorporated into peptide nucleic acid oligomers at a position 
corresponding to a region of variability in the target 
molecule. 

5 One nucleic acid mimic can contain one or more 

nucleobases modified as described above. It was found that 
increasing the number of nucleobases containing sterically 
bulky substituents within one nucleic acid mimic inhibited 
triplex formation while retaining the ability to form duplexes. 

10 In order to achieve the inhibition of triplex 

formation, the nucleic acid mimic and the position of 
attachment of the sterically bulky group are chosen such that 
the heterocyclic bases to which the sterically bulky 
substituent is attached would be located in close proximity to 

15 each other when bound to the nucleic acid, were a triplex to 
form. Preferably the substituted bases on the nucleic acid 
mimics should, in the hypothetical triplex, be located on the 
same side, i.e. base pairing to the same nucleobase of the 
nucleic acid strand. This case wherein the substituted bases 

20 of the mimic would base pair to the same base on the nucleic 
acid strand will be termed as "opposed". That the substituted 
bases would have to base pair with a predefined base on the 
nucleic acid strand can be achieved by choosing the base 
sequence and orientation of the mimics such that only the 

25 triplex formation could occur in a way which is inhibited by 
the use of the sterically bulky substituents. 

The compounds of the present invention can be used 
in methods for the determination of a nucleic acid comprising 
a nucleic acid mimic substituted at positions which are 1, 2 

30 or 3 atoms removed from the atom of the base which is attached 
to the backbone, incubating said nucleic acid mimics and said 
nucleic acid under conditions suitable for the formation of a 
duplex between said nucleic acid mimic and said nucleic acid 
and determining the occurrence of said duplex as a measure of 

35 the occurrence of said nucleic acid. These methods are 
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believed to function according to the principles described in 
WO 92/20703 (herein incorporated by reference) by replacing the 
compounds used in the prior art with the compounds described 
herein above. It is especially preferred to use a nucleic acid 
5 mimic which is labeled with a reporter group either at one of 
the termini of the nucleic acid mimic or at any position of the 
backbone or the base moieties. A reporter group according is 
a group that can be detected, for example a fluorescent group 
like fluorescein, or one which can be detected by a further 

10 compound which is bound in a subsequent step to the reporter 
group. For example, if the sterically bulky substituent is a 
biotin group or a group containing a biotin group, the nucleic 
acid mimic, and thereby the nucleic acid can be determined by 
adding detectable streptavidin to the hybrid. It is preferred 

15 to remove any excess biotin-labeled nucleic acid mimic from the 
mixture prior to this incubation. The reporter group is then 
detected by means which are known to the art-skilled. 

The present invention is suitable for detection of 
expression of a disease-causing protein in a cell or tissue 

20 sample from patients who have a disease state. A number of 
assays may be formulated for the inhibition of protein 
expression employing the present invention, which assays will 
commonly comprise contacting a cell or tissue sample with a 
nucleic acid mimic of the invention under conditions selected 

25 to permit detection, and usually quantitation, of such 
inhibition. As described below, f luorescein-labeled nucleic 
acid mimics are prepared and contacted with a cell or tissue 
sample suspected of expression of a disease-causing protein. 
The sample is then washed to remove unbound nucleic acid mimic. 

30 Fluorescence remaining in the sample, detected and quant itated 
by fluorimetry, indicates bound nucleic acid mimic (which in 
turn indicates the presence of nucleic acid encoding the 
disease-causing protein) . 

The compounds of the present invention may be useful 

35 in binding to target molecules. Target molecules of the 
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present invention can include any of a variety of biologically 
significant molecules. Such target molecules may be nucleic 
acid strands such as significant regions of DNA or RNA which 
encode proteins that may be responsible for causing and/ or 
5 maintaining a disease state in mammals* Such other target 
molecules may be transcription factors. Target molecules can 
be carbohydrates, glycoproteins or other proteins. In some 
preferred embodiments, the target molecule can be a protein 
such as an immunoglobulin, receptor, receptor binding ligand, 

10 antigen or enzyme, and more specifically can be a 
phospholipase, tumor necrosis factor, endotoxin, interleukin, 
plasminogen activator, protein kinase, cell adhesion molecule, 
lipoxygenase, hydrolase or transacylase. In other embodiments 
of the invention, the target molecule may be an important 

15 region of the human immunodeficiency virus, Candida, herpes 
viruses , papillomaviruses , cytomegalovirus , rhinoviruses , 
hepatitis viruses or influenza viruses. In yet other 
embodiments of the invention, the target molecule may be a 
region of an oncogene . 

20 The following examples further illustrate the 

invention and are not intended to limit the same. 

EXAMPLES 
EXAMPLE 1 

A. Exemplary General Syntheses 

25 r nOS^O %^^^\ M^ftritn^n t o r were purchased from Cruachem (UK) and 



the DNA oligomers were assembled on a MilliGen/Biosearch 8700 
DNA synthesizer. The A, C, G and T PNA monomers were purchased 
from Biosearch (USA). N' -Boc-aminoethyl glycine was purchased 
from Biosearch (USA) . All PNA oligomers were synthesized on 
30 a custom-made PNA synthesizer (Biosearch, USA) by a modified 
Merrif ield method (Christensen, L. , Fitzpatrick, R. , Gildea, 
B., Warren, B. and Coull, J. (1994), Innovations and 
Perspectives in Solid Phase Synthesis, R. Epton, Ed., SPCC (UK) 
Ltd., Oxford, England; Christensen et al., (1995), J. Pep. 
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Sci., 3, 175) and purified by reverse phase-HPLC. The PNA 
oligomers were characterized by FAB^MS. 

B. T^ Measurements 

Absorbance versus temperature was measured at 2 60 nm 
5 using a Guilford Response spectrophotometer. Heating rate was 
O.S^'C/min from 5-90®C. PNA oligomers were hybridized with 
complementary DNA sequences in a medium salt buffer containing 
100 mM NaCl, 10 mM sodium phosphate and 0.1 mM EDTA, pH was 
adjusted to 5, 7 or 9, as desired. The samples were heated to 
10 90®C for 5 min, slowly cooled to 20^ and left at 4*^0 for 30 min 
prior to T„ measurements. 

C. 8ynthesis of modified cytosine monomer 

(i) Benzoyl cytosin-l-ylacetate (1) 

Reference is made to Figure 1 where to cytosine (20 
15 g, 0.18 mol) in 400 mL DMF was added 7.2 g (0.18 mmol) of NaH 
(disp. in oil 60%) . The mixture was heated to 50**C and stirred 
for 2 h under nitrogen. After cooling to room temperature, 29 
mL (1.1 eq.) of benzyl bromoacetate was added over 2 h. After 
stirring overnight, the dark suspension was filtered and the 
20 filtrate washed with cold DMF and 0.2 M sodium bicarbonate. 
The product (1) was crystallized from ethanol. Yield: 37 g 
(79%). NMR (d^-DMSO) : S 4.56 (s, 2 H, CHjO) , 5.24 (s, 2 H, 

CHjCO) , 5.77 (d, 1 H, H5) , 7.20 (dd, 2 H, NHj) , 7.45 (m, 5 H, 
aromatic), 7.65 (d, 1 H, H^) . MS (FAB) m/z 260 (M+H) 
25 (calcd 260) . 

(ii) (M^- (Benzoyl) cytosin-1 yl) acetic Acid (2) 

To a solution of (1) (10 g, 38 mmol) in 10 mL 
pyridine was added 6.6 g (47 mmol) of benzoyl chloride and 
stirred overnight at room temperature. The solution was 

3 0 evaporated under reduced pressure. The residue was dissolved 
in 1 M KOH and stirred for 3 h after which the Ph was adjusted 
to 2 with cone. HCl. The target compound (2) precipitated out. 
Yield: 9.3 g (90%). *H NMR (d^-DMSO) : 5 4 . 59 (s, 2 H, CHjO) , 
7.31 (d, 1 H, H5) , 7.5-8.2 (7 H, aromatic, NH, H<s) . MS (FAB) 

35 m/z 273 (M+H) ''^ (calcd 273). 
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(iii) N- ( (N^- (Benzoyl) cytosin-l-yl) acetyl) -N- (2-Boc- 
aminoethyl) glycine (3) 

4.8 g (22 mmol) of Methyl N- (2-Boc-aminoethyl) - 
glycinate (2), 2.4 g (14.7 mmol) of benzyloxycarbonyl chloride, 
5 2*.9 g (14.9 mmol) of DCC and 2.4 g (14.7 mmol) of DhBtOH was 
dissolved in 50 mL of DMF and stirred for 4 h at room 
temperature. Dichloromethane (100 mL) was added and the 
mixture extracted with 3 x 0.2 M sodium bicarbonate, 2 x 1 M 
sodium hydrogen sulfate and brine. The organic phase was dried 

10 with magnesium sulfate and evaporated to dryness under reduced 
pressure. The residue was dissolved in 2 M KOH and stirred for 
1 h after which the pH was adjusted to 2 with 1 M HCl, whereby 
the target compound precipitated. The product (3) was 
crystallized from methanol: ethyl acetate: hexane (1:2:2). 

15 Yield: 4.2 g (60%). *H NMR (dg-DMSO) : S 1.45 and 1.47 (d, 9 H, 
Hoc), 3.28-3,53 (m, 4 H, CH2) , 4.08 and 4.31 (s, 2 H, CH2CO) , 
4.75 and 4.95 (s, 2 H, CH2CO) , 6.83 and 7.03 (m, 1 H, BocNJJ) , 
7.38 (m, 1 H, H5) , 7.57-8.10 (m, 6 H, aromatic and H^) . MS (FAB) 
m/z 474 (M+H)+ (calcd 474). 



20 EXAMPLE 2 

Triplex inhibition 

The effect of the benzoylated cytosine (C®^) residue 

on the hybridization properties of a homopyrimidine^ep^^ 

nucleic acid was studied. PNAl, H-TTTTCCTCTC-LysNHz,, warfe 

A. 

25 synthesized containing either C^^ in position 6 (PNA2) , or two 
C®^ residues in positions 6 and 8 (PNA3) or in positions 5 and 
6 (PNA4) . These PNAs were hybridized to a complementary 
oligonucleotide in the parallel (ODNl) or the antiparallel 
(ODN2) configuration and the thermal stability (T^) of the 

3 0 resulting complexes was determined at pH 5-9. The results are 
set forth in Table 1. Absorbance versus temperature curves 
were measured at 260 nm in 100 mM NaCl, 10 mM sodium phosphate 
and 0.1 mM EDTA. Heating rate: 0.5Vminute at 5-90*^0. The T^s 
in parentheses were obtained by cooling from 90"* to 10°C while 

70 
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TABLE 1 

Melting temperatures T„ (**C) for binding 
of PNA to single stranded homopurine DNA 
oligomer. 





ODNl 


0DN2 


pH 




PNA3 


58.5 (31.0) 
26. 0 


5 






40.5 
33.5 


7 
9 


PNA2 


56.0 (38.0) 
27.0 (20.0) 


54.0 (42.5) 
32.0 (29.0) 
31.0 (29.0) 


5 
7 
9 


PNA3 


28.0 


33.0 


7 


PNA4 


26.0 


32.5 


7 



Ollgodeoxynucleo'tldes : 

ODNl = 5 ' -AAAAGGAGAG-3 ' A -r^ -j-K 11/0 ' o 

ODN2 = 5 ' -GAGAGGAAAA- 3 ' JJ> ' ^ 

Nucleic acid mimics: 

PNAl = H-TTTTCCTCTC-LysNH2^,S62^ 3i> 3 

PNA2 = H-TTTTCC^^CTC-LysNfvA. 

PNA3 = H-TTTTCC^^C^^C-LysNrij^ 

PNA4 = H-TTTTC*^C*^CTC-LysNH2 



Unmodified PNAl exhibited expected behaviour. First, 
pronounced pH dependence was observed which is compatible with 
PNAj-DNA triplex formation requiring cytosine protonation. 
Second, the parallel complex showed highest stability at pH 5 
and 7, but not at pH 9. These results suggest that triplexes 
are the most stable complexes at pH 5 and 7, while the 
(antiparallel) duplex is more stable at pH 9. Triplex 
formation at pH 7 is also consistent with pronounced hysteresis 
(« 27 ®C) observed at this pH. 

PNA2, containing one C®^ residue, apparently also 
formed a triplex at pH 5 as judged by the hysteresis, but the 



5/ 
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was lower (« 3 0°C) than that of the PNAl complex. Thus, the 
benzoyl groups do Indeed appear to interfere with efficient 
triplex formation. This effect is especially pronounced at 
pH ?• Only slight hysteresis is observed and notably the 
5 antiparallel complex shows higher stability which does not 
decrease at more alkaline conditions (pH 9) . These results 
strongly argue in favour of the duplex being the most stable 
complex at pH 7 with this PNA. 

The complexes with PNAl and PNA2 showed equal thermal 
10 stability at pH 9, i. e, for the duplex, thus indicating that 
the C®^ residue does not interfere with Watson-Crick base 
pairing in the PNA-DNA duplex. This conclusion was supported 
by experiments with a C®^ containing mixed puri^e/pyrii^di^e 
^ sequence using the PNA oligomers H-^^T^^AC^CTA C-'^s^^ (PNAS)^ 

^ 15 and H-AGT CA C®^ CTA C-LysNH^^^NA6) , 'and is set forth in 

Table 2 . Absorbance versus temperature curves were measured 
at 260 nm in 100 mM NaCl, 10 mM sodium phosphate and 0.1 mM 
EDTA, at pH 7. Heating rate: 0.5%/min at 5-90 **C. The T^s in 
parentheses were obtained by cooling from 90 to 10**C while 
20 measuring the absorbance at 260 nm. The hysteresis of the 
system is the difference between the T„ (10-90**) and (90- 
10^) . 




# 
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Melting temperatures T„ ("C) for binding 
of PNA in duplex mode to single-stranded 
DNA oligomer. 





PNA5 


PNA6 


ODN3 


49 (48) 


50 


ODN4 


33 (31) 


34 



O 1 igodeoxynucleo 



:1 



ODN3 = 5'-GTAGGTCACT-3X^ 
ODN4 = 5'-GTAGATCACT-3j^ 
10 Nucleic acid mimics: _ 
PNA5 = H-AGTCACCTAC-LysNH^ 
PNA6 = H-AGTCAC^^CTAC-LysNHjyV^ 



Both of these oligomers form highly stable duplexes 
with their antiparallel oligonucleotide target. The 

15 stoichiometry of these complexes was determined by Job-plots 
as 1:1 complexes in both cases. The insignificant difference 
in TnjS of the complexes between PNA5 and PNA6 with ODN3 falls 
within experimental error and can be interpreted as evidence 
of the structure shown in Figure 2 , by which the benzoyl group 

2 0 is positioned in the major groove not interfering with the 
Watson-Crick base pairing. This is also in full agreement with 
the (G-^A) mismatch positioned opposite the cytosine in the DNA 
strand, giving rise to a drop in T^, of 15-16*' for both PNA5 and 
PNA6. An important feature distinguishing duplexes from 

25 triplexes under the experimental conditions is the very small 
hysteresis (less than 2®) obtained with duplexes when going 
from high to low temperature, whereas PNA: DNA triplexes showed 
pronounced hysteresis typically in the range of 20-30® (Table 
2) . This is also evident for the complexes between PNA6 and 
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ODN3 or ODN4 in which a hysteresis of 1-2 ®C was observed. The 
small hysteresis obtained with PNA6 also indicated that the 
benzoyl group does not interfere significantly with the binding 
kinetics. 



5 EX2VMPLE 3 

Coupling of nucleic acid mimic to fluorescein 

A nucleic acid mimic having a free amine moiety is 
dissolved in THFiHjO to provide a solution that is 0.1 M of 
nucleic acid mimic. To the nucleic acid mimic solution is 
10 added fluorescein isothiocyanate, providing a solution that is 
0. 1-1. 0 M in fluorescein isothiocyanate. The resultant 
reaction mixture is stirred for 0.1-2 hours and concentrated 
under reduced pressure. The residue is purified by preparative 
HPLC. 



15 EXAMPLE 4 

Detection of mutant j^-amyloid precursor protein gene expression 
(^APP) 

Point mutations in the gene encoding jS-amyloid have 
been implicated in familial Alzheimer's disease (FAD) • Nucleic 

20 acid mimics are labeled with fluorescein or other fluorescent 
tags , as illustrated in Example 3 above . The f luorescently- 
labeled nucleic acid mimics are contacted with a cell or tissue 
sample suspected of abnormal jSAPP expression under conditions 
suitable for specific hybridization of the nucleic acid mimic 

25 to the nucleic acid encoding abnormal )3app. The sample is then 
washed to remove unbound nucleic acid mimics. Label remaining 
in the sample indicates bound nucleic acid and is quantitated 
using a fluorimeter, fluorescence microscope or other routine 
means . 
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A first sample of cells or tissues suspected of 
expressing a point mutation in the )8APP gene is incubated with 
a f luorescein-labeled nucleic acid mimic which is targeted to 
the mutant codon 717, codon 670 or codon 671 of the jSAPP mRNA, 
5 A second identical sample of cells or tissues is incubated with 
a second labeled nucleic acid mimic which is targeted to the 
same region of normal jSAPP mRNA under conditions in which 
specific hybridization can occur. The sample is then washed 
to remove unbound nucleic acid mimic. Label remaining in the 
10 sample indicates bound nucleic acid and is quantitated using 
a fluorimeter or other routine means. The presence of mutant 
iSAPP is indicated if the first sample retains labeled nucleic 
acid mimic and the second sample does not retain labeled 
nucleic acid mimic. 



15 EX2^PLE 5 

Detection of mutant H-r-as gene expression 

Point mutations in the H-ras gene have been 
implicated in numerous aberrations of the ras pathway. Nucleic 
acid mimics are labeled with fluorescein or other fluorescent 

20 tags as illustrated in Example 3 above. Labeled nucleic acid 
mimics are contacted with cell or tissue samples suspected of 
abnormal rets expression under conditions in which specific 
hybridization can occur. The sample is then washed to remove 
unbound labeled nucleic acid mimic. Label remaining in the 

25 sample indicates bound nucleic acid (i.e. that which encodes 
for mutant ras) and is quantitated using a fluorimeter, 
fluorescence microscope or other routine means. 

A first cell or tissue sample suspected of expressing 
a point mutation in the H-iras gene is incubated, under 
30 conditions suitable for specific hybridization, with a 
f luorescein-labeled nucleic acid mimic which is targeted to 
codon 12, codon 13 or codon 61 of mutant H- ras mRNA. A second 
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identical sample of cells or tissues is incubated, under 
conditions suitable for specific hybridization, with a second 
f luorescently-labeled nucleic acid mimic which is targeted to 
the same region of normal H-ras mRNA. The samples are then 
5 washed to remove unbound labeled nucleic acid mimics. Label 
remaining in the sample indicates bound nucleic acid and is 
quantitated using a fluorimeter or other routine means. The 
presence of mutant H-ras is indicated if the first sample 
exhibits fluorescence but the second sample does not. 



10 EXAMPLE 6 

Inhibition of gene expression by nucleic acid mimics 



mimics to inhibit expression of the E2 mRNA of papillomavirus 
is based on the well-documented transactivation properties of 

15 E2 . Spalholtz et al . , J. Virol., 51, 2128 (1987). A reporter 
plasmid (E2RE1CAT) is constructed to contain the E2 responsive 
element, which functions as an E2 -dependent enhancer. E2RE1CAT 
also contains the SV40 early promoter, an early polyadenylation 
signal and the chloramphenicol acetyl transferase (CAT) gene. 

20 Within the context of this plasmid, CAT expression is dependent 
upon expression of E2. The dependence of CAT expression upon 
the presence of E2 is tested by transfection of this plasmid 
into C127 cells transformed by BPV-1, uninfected C127 cells and 
C127 cells cotransf ected with E2RE1CAT and an E2 expression 

2 5 vector . 



transformed C127 cells are plated in 12-well plates. Twenty 
four hours prior to transfection with E2RE1CAT, cells are 
pretreated by the addition of complementary nucleic acid mimic 
3 0 to the growth medium at a final concentrations of 5, 15 and 30 
mM. The next day, cells are transfected with 10 fig of E2RE1CAT 
by calcium phosphate precipitation. E2RE1CAT (10 fig) and 



A preferred assay to test the ability of nucleic acid 



Inhibition of BPV-1 E2 expression: 



BPV-1 
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carrier DNA (PUC 19, 10 ^g) are mixed with 62 of 2 M CaClj 
in a final volume of 250 /aL of HjO, followed by the addition of 
250 /iL of 2X HBSP (1.5 mM Na2P04, 10 mM KCl, 280 mM NaCl, 12 mM 
glucose and 50 mM HEPES, pH 7.0) and incubated at room 
5 temperature for 30 minutes. This solution (100 ^L) is added 
to each test well and allowed to incubate for 4 hours at 37 °C. 
After incubation^ the cells are glycerol shocked for 1 minute 
at room temperature with 15% glycerol in 0.75 mM Na2P04, 5 mM 
KCl, 140 mM NaCl^ 6 mM glucose and 25 mM HEPES, pH 7.0. After 
10 shocking, the cells are washed 2X with serum-free DMEM and 
refed with DMEM containing 10% fetal bovine serum and nucleic 
acid mimic at the original concentration. Forty eight hours 
after transf ection, the cells are harvested and assayed for CAT 
activity. 

15 For determination of CAT activity, cells are washed 

2X with phosphate-buffered saline and collected by scraping. 
Cells are suspended in 100 /xL of 250 mM Tris-HCl, pH 8.0, and 
disrupted by f reeze-thawing three times. This cell extract (25 
/xL) is used for each assay. 

20 For each assay, the following are mixed together in 

a 1.5 mL Eppendorf tube and incubated at 37 ®C for one hour: 
25 /iL of cell extract, 5 /xL of 4 mM acetyl coenzyme A, 18 ftL 
of H2O and 1 filj of ^"^C-chloramphenicol , 40-60 mCi/mM. After 
incubation, chloramphenicol (acetylated and non-acetylated 
25 forms) is extracted with ethyl acetate and evaporated to 
dryness. Samples are resuspended in 25 /xL of ethyl acetate, 
spotted onto a tic plate and chromatographed in 
chloroform: methanol (19:1). The chromatographs are analyzed 
by autoradiography. Spots corresponding to acetylated and non- 
30 acetylated ^"^C-chloramphenicol are excised from the tic plate 
and counted by liquid scintillation for quantitation of CAT 
activity. Nucleic acid mimics that depress CAT activity in a 
dose-dependent manner are considered to have a positive effect. 



35 inhibition of human papillomavirus (HPV) E2 by nucleic acid 



D. Inhibition of HPV E2 expression: The assay for 
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mimics is essentially the same as that for BPV-1 E2 . For HPV 
assays, appropriate HPVs are cotransf ected into either CV-1 or 
A431 cells with PSV2NEO using the calcium phosphate method 
described above. Cells which take up DNA are selected for 
5 culturing in media containing the antibiotic G418. G418- 
resistant cells are then analyzed for HPV DNA and RNA, Cells 
expressing E2 are used as target cells for complementary 
studies. For each nucleic acid mimic, cells are pretreated as 
above, transf ected with E2RE1CAT and analyzed for CAT activity 
10 as described above. Nucleic acid mimics are considered to have 
a positive effect if they can depress CAT activity in a dose- 
dependent manner. 





